Non-Stationary Bandits with Habituation and Recovery Dynamics

نویسندگان

  • Yonatan Mintz
  • Anil Aswani
  • Philip Kaminsky
  • Elena Flowers
  • Yoshimi Fukuoka
چکیده

Yonatan Mintz, Anil Aswani, Philip Kaminsky Department of Industrial Engineering and Operations Research, University of California, Berkeley, CA 94720, {aaswani,kaminsky,ymintz}@berkeley.edu Elena Flowers Department of Physiological Nursing, School of Nursing, University of California, San Francisco, CA 94143, [email protected] Yoshimi Fukuoka Department of Physiological Nursing & Institute for Health & Aging, School of Nursing, University of California, San Francisco, CA 94143, [email protected]

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Online Marketing Experiments with Drifting Multi-armed Bandits

Restless bandits model the exploration vs. exploitation trade-off in a changing (non-stationary) world. Restless bandits have been studied in both the context of continuously-changing (drifting) and change-point (sudden) restlessness. In this work, we study specific classes of drifting restless bandits selected for their relevance to modelling an online website optimization process. The contrib...

متن کامل

Discrepancy-Based Algorithms for Non-Stationary Rested Bandits

We study the multi-armed bandit problem where the rewards are realizations of general nonstationary stochastic processes, a setting that generalizes many existing lines of work and analyses. In particular, we present a theoretical analysis and derive regret guarantees for rested bandits in which the reward distribution of each arm changes only when we pull that arm. Remarkably, our regret bound...

متن کامل

Stochastic Bandits with Pathwise Constraints

We consider the problem of stochastic bandits, with the goal of maximizing a reward while satisfying pathwise constraints. The motivation for this problem comes from cognitive radio networks, in which agents need to choose between different transmission profiles to maximize throughput under certain operational constraints such as limited average power. Stochastic bandits serve as a natural mode...

متن کامل

Effects of stocking density, feeding technique and vitamin C supplementation on the habituation on dry feed of pikeperch (Sander lucioperca) pond reared juveniles

Influence of three different stocking densities, vitamin C supplementation of Daphnia spp. and feeding practice (i.e. mechanical, hand) on the success of dry feed habituation of pond reared pikeperch juveniles was investigated through one month trial. Pond reared pikeperch juveniles were harvested 42 days post-fertilisation (mean individual weight 1.1 ± 0.3g) and stocked into the experim...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1707.08423  شماره 

صفحات  -

تاریخ انتشار 2017